Search for: All records

Creators/Authors contains: "Osher, Stanley J."

« Prev Next »

Total Resources

6

Resource Type
Conference Paper

2

Conference Proceeding

0

Dataset

0

Journal Article

4

Workshop Report

0

Availability
Full Text / Resource Available

6

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent

https://doi.org/10.1137/21M1453311

Wang, Bao ; Nguyen, Tan ; Sun, Tao ; Bertozzi, Andrea L. ; Baraniuk, Richard G. ; Osher, Stanley J. ( June 2022 , SIAM Journal on Imaging Sciences)

Full Text Available
Laplacian Smoothing Stochastic Gradient Markov Chain Monte Carlo

https://doi.org/10.1137/19M1294356

Wang, Bao ; Zou, Difan ; Gu, Quanquan ; Osher, Stanley J. ( January 2021 , SIAM Journal on Scientific Computing)
null (Ed.)
Full Text Available
Determining the three-dimensional atomic structure of an amorphous solid

https://doi.org/10.1038/s41586-021-03354-0

Yang, Yao ; Zhou, Jihan ; Zhu, Fan ; Yuan, Yakun ; Chang, Dillan J. ; Kim, Dennis S. ; Pham, Minh ; Rana, Arjun ; Tian, Xuezeng ; Yao, Yonggang ; et al ( April 2021 , Nature)
null (Ed.)
Full Text Available
MomentumRNN: Integrating Momentum into Recurrent Neural Networks

Nguyen, Tan M. ; Bertozzi, Andrea L ; Osher, Stanley J ; Wang, Bao ( January 2020 , 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada)

Designing deep neural networks is an art that often involves an expensive search over candidate architectures. To overcome this for recurrent neural nets (RNNs), we establish a connection between the hidden state dynamics in an RNN and gradient descent (GD). We then integrate momentum into this framework and propose a new family of RNNs, called MomentumRNNs. We theoretically prove and numerically demonstrate that MomentumRNNs alleviate the vanishing gradient issue in training RNNs. We study the momentum long-short term memory (MomentumLSTM) and verify its advantages in convergence speed and accuracy over its LSTM counterpart across a variety of benchmarks. We also demonstrate that MomentumRNN is applicable to many types of recurrent cells, including those in the state-of-the-art orthogonal RNNs. Finally, we show that other advanced momentum-based optimization methods, such as Adam and Nesterov accelerated gradients with a restart, can be easily incorporated into the MomentumRNN framework for designing new recurrent cells with even better performance.
more » « less
Full Text Available
DP-LSSGD: A Stochastic Optimization Method to Lift the Utility in Privacy-Preserving ERM

Wang, Bao ; Gu, Quanquan ; Boedihardjo, March ; Wang, Lingxiao ; Barekat, Farzin ; Osher, Stanley J. ( January 2020 , Mathematical and Scientific Machine Learning Conference)

Full Text Available
A machine learning framework for solving high-dimensional mean field game and mean field control problems

https://doi.org/10.1073/pnas.1922204117

Ruthotto, Lars ; Osher, Stanley J. ; Li, Wuchen ; Nurbekyan, Levon ; Fung, Samy Wu ( April 2020 , Proceedings of the National Academy of Sciences)

Mean field games (MFG) and mean field control (MFC) are critical classes of multiagent models for the efficient analysis of massive populations of interacting agents. Their areas of application span topics in economics, finance, game theory, industrial engineering, crowd motion, and more. In this paper, we provide a flexible machine learning framework for the numerical solution of potential MFG and MFC models. State-of-the-art numerical methods for solving such problems utilize spatial discretization that leads to a curse of dimensionality. We approximately solve high-dimensional problems by combining Lagrangian and Eulerian viewpoints and leveraging recent advances from machine learning. More precisely, we work with a Lagrangian formulation of the problem and enforce the underlying Hamilton–Jacobi–Bellman (HJB) equation that is derived from the Eulerian formulation. Finally, a tailored neural network parameterization of the MFG/MFC solution helps us avoid any spatial discretization. Our numerical results include the approximate solution of 100-dimensional instances of optimal transport and crowd motion problems on a standard work station and a validation using a Eulerian solver in two dimensions. These results open the door to much-anticipated applications of MFG and MFC models that are beyond reach with existing numerical methods.

more » « less